Utilizing prosody for unconstrained morpheme recognition
نویسندگان
چکیده
Speech recognition systems for languages with a rich in ectional morphology (like German) su er from the limitations of a word{based full{form lexicon. Although the morphological and acoustical knowledge about words is coded implicitly within the lexicon entries (which are usually closely related to the orthography of the language at hand) this knowledge is usually not explicitly available for other tasks (e.g. detecting OOV words). This paper presents an HMM{based `word' recognizer that uses morphemes on the string level for recognizing spontaneous German conversational speech (Verbmobil corpus). The system has no explicit word knowledge but uses a morpheme{bigram to capture the German word and sentence structure to some extent. The morpheme recognizer is tightly coupled with a prosodic classi er in order to compensate for some of the additional ambiguity introduced by using morphemes instead of words. Although the recognizer's morpheme accuracy of 85:3% is comparable to that of our word{based decoder (word accuracy 86%) until now the bene t of introducing the prosodic classi er is not yet clear.
منابع مشابه
Large Vocabulary Continuous Speech Recognition for Estonian Using Morpheme Classes
This paper describes development of a large vocabulary continuous speaker independent speech recognition system for Estonian. Estonian is an agglutinative language and the number of different word forms is very large, in addition, the word order is relatively unconstrained. To achieve a good language coverage, we use pseudo-morphemes as basic units in a statistical trigram language model. To im...
متن کاملLarge Vocabulary Continuous Speech Recognition for Estonian Using Morphemes and Classes
This paper describes development of a large vocabulary continuous speaker independent speech recognition system for Estonian. Estonian is an agglutinative language and the number of different word forms is very large, in addition, the word order is relatively unconstrained. To achieve a good language coverage, we use pseudo-morphemes as basic units in a statistical trigram language model. To im...
متن کاملEvidence Theory-Based Multimodal Emotion Recognition
Automatic recognition of human affective states is still a largely unexplored and challenging topic. Even more issues arise when dealing with variable quality of the inputs or aiming for real-time, unconstrained, and person independent scenarios. In this paper, we explore audio-visual multimodal emotion recognition. We present SAMMI, a framework designed to extract real-time emotion appraisals ...
متن کاملRobust Iris Recognition in Unconstrained Environments
A biometric system provides automatic identification of an individual based on a unique feature or characteristic possessed by him/her. Iris recognition (IR) is known to be the most reliable and accurate biometric identification system. The iris recognition system (IRS) consists of an automatic segmentation mechanism which is based on the Hough transform (HT). This paper presents a robust IRS i...
متن کاملSpoken Keyword Rescoring and Document Retrieval for Low-resource Languages
For languages that have adequate data for automatic speech recognition (ASR), many keyword search(KWS) and document retrieval(SDR) systems have been developed with near-optimal performance. However, lacking of sufficient training data to produce high accuracy transcript, identification and retrieval of queries in speech data from low-resources languages remains challenging. To compensate for th...
متن کامل